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ABSTRACT 

Context. Halo occupation distribution (HODj is a powerful statistic that allows the study of several aspects of the matter distribution 
in the universe, such as evaluating semi-analytic models of galaxy formation or imposing constraints on cosmological models. Con¬ 
sequently, it is important to have a reliable method for estimating this statistic, taking full advantage of the available information on 
current and future galaxy surveys. 

Aims. The main goal of this project is to combine photometric and spectroscopic information using a discount method of background 
galaxies in order to extend the range of absolute magnitudes and to increase the upper limit of masses in which the HOD is estimated. 
We also evaluate the proposed method and apply it to estimating the HOD on the Sloan Digital Sky Survey Data Release 7 (SDSS 
DR7) galaxy survey. 

Methods. We propose the background subtraction technique to mel information provided by spectroscopic galaxy groups and photo¬ 
metric survey of galaxies. To evaluate the feasibility of the method, we implement the proposed technique on a mock catalogue built 
from a semi-analytic model of galaxy formation. Furthermore, we apply the method to the SDSS DR7 using a galaxy group catalogue 
taken from spectroscopic version and the corresponding photometric galaxy survey. 

Results. We demonstrated the validity of the method using the mock catalogue. We applied this technique to obtain the SDSS DR7 
HOD in absolute magnitudes ranging from M = -21.5 to M = -16.0 and masses up to - 10 15 M o throughout this range. On the 
brighter extreme, we found that our results are in excellent agreement with those obtained in previous works. 

Key words, galaxies: haloes - galaxies: statistics - dark matter- large-scale structure of Universe. 


1. Introduction 

The current paradigm assumes that galaxies form by baryon con¬ 
densation within the potential wells defined by the collisionless 
collapse of dark matter haloes ( jWhite & Rees|l978) , but the di¬ 
versity of astrophysical mechanisms involved in galaxy forma¬ 
tion do not allow us to determine how galaxies occupy haloes. 
Clustering studies are powerful tools used to connect galaxies 
to their unseen dark matter haloes and indicate how these were 
assembled. 

Recently, with large galaxy surveys, the halo occupation dis¬ 
tribution (HOD) and conditional luminosity function (CLF) are 
intensively investigated as methods for studying galaxy cluster¬ 
ing. The two methods are closely related, but CLF considers the 
average number of galaxies with luminosity L ± dL/2 that re¬ 
side in a halo of mass Mh (e.g. |van den Bosch et al.| (|2003b|); 
|Yang et ah (|2003|l;|Vale & QstrikeM2004) ; van den Bosch et al.) 
( |2007| >; Va e & Ostriker ( 2006) 1) and HOD describes the prob¬ 
ability distribution P{N\Mh) that a virialized halo of mass M/, 


contains N galaxies with some specified characteristic (e.g. Jing 
et al. ( 1998[>; ling & Boerner| ( 1998 ; Ma & Fry] ( 2000b; |Pea 


cock & Smith 


(j2000) 

Berlind & Weinberg|(]2002); Cooray & Sheth (|2002) ; Berlind et| 

al. i 2003); Zheng et al. i 2005 l|Yang et al.|( 2008)). 

Many authors pointed out the importance of HOD when de¬ 
scribing the relation between galaxies and their hosting dark 


; Seljak (2000); Scoccimarro et al. (2001); 


matter haloes (e.g. |Yang et al.| ( [2007) ; [Yang et al.| ( |200 8 )). This 
description is useful in order to constrain models of galaxy for¬ 
mation and evolution (e.g. |Berlind et al.| ([2003 ); Kravtsov et al. 
(200 4)); it can pr ovide constraints to cosmological models ( |van 
den Bosch et al. ( |2003) ; Zheng & Weinberg ( 2007) ); and it al¬ 
lows the contribution of central and satellite galaxies to be stud¬ 
ied separately (|Kravtsov et al. (2004); Cooray (2005); Zhe ng et| 


ak[( [2505) ; |White et al.| ( |2007) ; |Yang et al.| ( [2008) ). The 


ast prop¬ 
erty contains important information about hierarchical cluster¬ 
ing because the number of satellite galaxies around the central 
galaxy in a dark matter halo provides insights into the formation 
of massive haloes from smaller ones and can be used to con¬ 
strain cosmological parameters. Furthermore, as pointed out by 
Wang & White ( |2012 1 , the overprediction of satellite galaxies 
by numerical simulations called “missing satellite galaxies” is a 
particularly striking example of how satellite galaxy studies can 
constrain cosmology and galaxy evolution. This gives valuable 
information that can be compared with simulations and used to 
constrain semi-analytic models. 


E-mail :facundo @ oac .unc .edu. ar (FR); 


As indicated by Liu et al. ( 2011) , a common method for over¬ 
coming the lack of spectroscopic information for satellite galax¬ 
ies is to count candidate satellites around a bright host in pho¬ 
tometric data and to statistically correct for the contribution of 
foreground and background objects. Consequently a lower limit 
of apparent magnitudes could be accomplished by employing 
the enormous volume of photometric data together with a back¬ 
ground subtraction approach, which is an efficient method for 
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studying the galaxy population on a statistical basis ( [Lares et| 


al. (201 111). Some examples of satellite galaxies at fainter mag¬ 


nitudes combining a redshift survey with a photometric survey 


The proposed method require two catalogues that share the same 
sky area, one for groups and another for galaxies. For each group 
the method requires mass, redshift, angular positions, and a char- 


are 

Holmberg (1969^ 

^hillipps & Shanks (1987]), |Lorrimer et 

acteristic radius, whereas for galaxies only angular positions and 
apparent magnitudes are necessary. 

al. 

1994), Smith et al. 


i, Chen et al. (2006), [Lares et al. 

(20 

i t, Guo et al. (201 

), and Tal et al. (2012)). 

Given that only photometric information is available, to es- 


Most studies have taken advantage of the large volume of 
data provided by the Sloan Digital Sky Survey (SDSS;[York et al. 
( |2000| ) and use both spectroscopic and photometric data to study 
the luminosity function and number density of satellite galaxies 
(e.g.|Chen et al.| ( |2006| ); [Lares et aklP011| ); |Guo et al.| ( |2011[ ); 
|Tal et al.| ( |2012| >) but there are no similar works about HOD. 

In this work we focus on developing the background sub¬ 
traction method as a technique to determine the HOD. Given the 
depth of the Sloan photometric catalogue, this technique will al¬ 
low us to expand the range of limiting absolute magnitude; at 
the same time, more distant galaxy groups will be included. This 
feature will result in group samples with a wider mass range. 

The background subtraction method applied to the HOD de¬ 
termination has not been widely used and is frequently criticized 
for its inability to distinguish between the local and overlying 
overdensities. Consequently, to demonstrate the correct perfor¬ 
mance of our implementation, we have constructed a synthetic 
catalogue from the semi-analytic model of galaxy formation de¬ 
veloped by Guo et al. ( 2010j > applied to the Millennium I Simu¬ 
lation. 

Using galaxy groups provided by Yang et al. ( 2012j ) taken 
from the spectroscopic version of the SDSS DR7 survey and 
galaxies from the corresponding photometric galaxy catalogue, 
we estimate the HOD for different limiting absolute magnitudes 
and the associated parameterizations according to Zheng et al. 
p'0'051 . These results are compared with the results of Yang et 
al. (2008 ), who - based on the same formalism and using the 
SDSS data release 4 group catalogue - determine the parame¬ 
ters of the HOD for different absolute magnitudes using volume 
limited samples. 

This paper is organized as follows. In section [2] we explain 
the procedure. Section [3] describes real and simulated data. The 
test of the method is presented in section [4] Section [5] shows the 
HOD of the SDSS DR7 estimated using our method. Finally, our 
results are summarized in section[6] 


2. Background subtraction method to estimate the 
HOD 

For the background subtraction procedure it is assumed that 
large scale galaxy distribution is uniform, while groups are lo¬ 
cal overdensities. As is explained below, our method statistically 
estimates the number of galaxies that are in front of and/or be¬ 
hind the group (hereafter background galaxies), but, in projec¬ 
tion, seem to belong to it. Given that this technique allows galaxy 
systems taken from spectroscopic surveys to be combined with 
catalogues without redshift information, it is possible to estimate 
the HOD in a wider range of magnitudes and at the same time 
improve the statistics. The main contribution of this method is 
the ability to determine the HOD of satellite galaxies in fainter 
magnitudes than those studied until now. 

The background subtraction technique involves counting ob¬ 
jects in a region where there is a known signal superposed on an 
uncorrelated noise, and subtracting a statistical estimation of that 
noise. In our case, the signal is conformed by the overdensities 
where the galaxy groups lie, meanwhile the noise is associated 
with the background galaxies that do not belong to the group. 


timate the HOD we compute the absolute magnitude assuming 
that all galaxies are located at the same redshift of the group z gr . 


M = m- 25-5 log (d L (z gr )), 


(1) 


where £//.(z?r) is the luminosity distance in Mpc at z gr - 

Then we define n, as the number of galaxies that have an 
absolute magnitude M < Mn,„ within a circle centred in each 
group with a radius determined by the corresponding projected 
characteristic radius on the sky. The value of M/„„ sets the upper 
limit for the range of absolute magnitudes where the HOD will 
be estimated. 

Because it is not possible to distinguish between galaxies be¬ 
longing to the group and background galaxies, we need some 
way to know the local number of background galaxies in order 
to discount it from n,. The contribution of the background galax¬ 
ies to tij cannot be determined straightforwardly; consequently, a 
statistical method is required. Taking into account the hierarchi¬ 
cal behaviour of the large scale structure, it is known that a given 
overdensity is always immersed in a bigger structure; therefore, 
we estimate the background contribution by counting the num¬ 
ber of galaxies, n„, that meet our selection criteria (M < M/, m ) in 
an annulus around each galaxy group instead of taking the aver¬ 
age density over the whole catalogue. The number of galaxies N 
can be estimated subtracting the local background density mul¬ 
tiplied by the projected area for each group (i.e. the background 
contribution to this area), 


N = iii - —A ,, 
A n 


( 2 ) 


where A, and A a are the projected areas of a circle centred on the 
group and the corresponding annulus, respectively. 

Since the method has no restrictions on the choice of groups, 
we should be careful when choosing the samples used in each 
measurement. Specifically, low mass groups cannot be detected 
at high redshifts because they typically contain weaker galaxies, 
while high mass groups can be observed in the whole range of 
redshift. To consider this fact, we perform different cuts in red¬ 
shift for each mass range such that at least the brightest galaxy 
in each group in this mass range is visible. 

Finally, groups are binned in mass intervals to obtain the 
HOD as the average of N, (N\M h ). It should be taken into ac¬ 
count that the groups used in this method should be near enough 
so that a galaxy with absolute magnitude equal to M/,„, at the 
group redshift has an apparent magnitude brighter than the lim¬ 
iting apparent magnitude of the photometric catalogue. 


3. Data 

To test the described method we take advantage of simulated 
data, while to obtain an estimation of the HOD we use the 
SDSS DR7 galaxy catalogue. In this work, as in Millennium I 
Simulation, we adopt the cosmological parameters taken from 
WMAP1 data ( [Spergel et al.|2003| , i.e. a flat cosmological model 
with a non-vanishing cosmological constant (ACDM): 12,„=0.25, 
Q/,=0.045, G a =0.75, ct 8 =0.9, n= 1, and /t=0.73. 
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3.1. SDSS Catalogues 


The Sloan Digital Sky Survey is a photometric and spectroscopic 
survey constructed with a dedicated 2.5 m telescope at Apache 
Point Observatory in New Mexico. The main advantage of this 
work is to combine photometric and spectroscopic data, and so 
we will take advantage of the full set of data. In particular, we 
employ the Legacy footprint area which covers more than 8400 
deg 2 in five optical bandpasses. Imaging data have 230 million 
objects with apparent magnitudes in the range of the r-band of 
less than 22.2, and spectroscopic data provide approximately 
900,000 spectra of galaxies with redshift measurements up to 
z < 0.3 and an upper apparent magnitude limit of 17.77 in the 
r-barul ( [Abazajian et al.|2009j ). 

The groups used in our analysis are taken from the SDSS 
galaxy group catalogue of Yang et ak| ( |2007| l, constructed us¬ 
ing the adaptive halo-based group finder presented injYang et al. 
( |2005| >, but updated to DR7 ( Yang et al.|2012[). Unlike the tradi 


tional friend-of-friends (FOF) methods (e.g. Merchan & Zandi- 
varez ( |2005[ >), a property of this group finder is the ability to 
identify systems even when they have only a single member, 
which increases the number of groups and at the same time al¬ 
lows a wider dynamic range of masses. We also use this group 
catalogue because it will allow us to compare our results with 
those obtained previously by the same author. 


3.2. Simulated data 

In order to test our method, we construct a mock catalogue us¬ 
ing synthetic galaxies extracted from a semi-analytic model of 
galaxy formation applied on top of the Millennium Run Simula¬ 
tion I. 

The Millennium Simulation is a cosmological N-body sim¬ 
ulation (Sprin ge! et al.||2005[ >, which offers high spatial and 
time resolution within a large cosmological volume. This sim¬ 
ulation evolves more than 10 billion dark matter particles in a 
500 lr x Mpc periodic box, using a comoving softening length of 
5 !r x kpc. Given that the Millennium Simulation is dark matter 
only, it can be populated with galaxies using a semi-analytic 
galaxy formation model. For our synthetic catalogue we use the 
model developed by Guo et alj ( |2010) > publicly available at the 
Millennium Database []] We also take from this database the in¬ 
formation about the dark matter halo where galaxies reside. 

To build our mock catalogue, we place the observer at the 
origin of the Millennium simulation box and, taking account of 
the periodicity, we simply repeat the simulated volume as many 
times as necessary to reproduce the volume of the photometric 
version of the SDSS DR7 catalogue. The redshifts are calcu¬ 
lated from the distances to the observer taking into account the 
distortion produced by proper motion and the cosmological pa¬ 
rameters used in the simulation. The flux limited selection of the 
mock catalogue is then obtained by imposing the same upper 
apparent magnitude threshold that affects the SDSS catalogue. 
In order to determine the angular selection function of the sur¬ 
vey, we use the mask described in section [5] In addition, a mock 
group sample is produced from the mock catalogue keeping the 
original dark matter halo membership used in the simulation. 


4. Test of the method 

The aim of this section is to demonstrate the effectiveness of 
the proposed technique. For this purpose, we use the mock cata- 

1 http://gavo.mpa-garching.mpg.de/MyMillennium 


logue described above where we know in detail the population of 
galaxies in dark matter haloes, which can be directly compared 
to the results obtained with our procedure. 

As is known, flux limited samples do not provide enough in¬ 
formation to determine a representative size of groups because 
the number of visible members depends on the distance. Conse¬ 
quently, we use the semi-analytic model to set a relation between 
the characteristic radius of groups and their masses, which can 
be estimated using the visible members. For our purposes, we 
define the characteristic radius as the distance from the centre 
of the host halo to the most distant galaxy member. Assuming 
a linear behaviour between mass and volume, we propose the 
relation 

r c = C\m\ - C 2 , (3) 

where r c is the characteristic radius and M/, is the host halo mass. 
The parameters C\ and C 2 are determined by fitting the semi- 
analytic data using a simple chi-square minimization technique, 

obtaining C\ = 4.31 x 10 s Mpc M Q 3 and C 2 = -0.33 Mpc. 
To compute the background contribution as described in section 
[2] the internal and external radius of the annulus are chosen as 
r c + 1 Mpc and r c +2Mpc, respectively. We found that the results 
are not sensitive to the size of this annulus, allowing us to choose 
a compromise between the influence of the local overdensity and 
statistical robustness. 

For each absolute limiting magnitude we apply our method 
of determining the HOD on the simulated data and compare the 
values with the HOD obtained by simply counting the number of 
galaxies in the simulated box. Figure[l]shows the results for dif¬ 
ferent absolute limiting magnitudes ranging from M\ im — -21.5 
to Mu,,, = -16.0. As can be seen, beyond a small overpredic¬ 
tion for fainter magnitudes, there is a good agreement between 
our results (shaded grey bands) and those obtained by directly 
counting galaxies in the simulated box (solid lines) in all ranges 
of absolute magnitudes. We have also included in this figure the 
HOD corresponding to the volume limited samples to point out 
the ability of our method to achieve higher masses at low lumi¬ 
nosities. 

As mentioned in section [2j it is necessary to restrict the 
group samples to avoid any bias produced by the fact that mas¬ 
sive groups can be observed at higher distances than lower mass 
groups. In the case of the mock catalogue we studied the distri¬ 
bution of the brightest galaxy of each group and we found that 
for groups with masses < 3.16 x 10 13 M Q it is enough to take a 
maximum redshift of z = 0.11 given that the tail of the brightest 
galaxy distribution lies approximately at M ^ -19.5. For higher 
masses a redshift cut is not necessary. Furthermore, for abso¬ 
lute magnitudes fainter than M — -18 the group sample should 
be restricted to a redshift determined by the limiting apparent 
magnitude of the photometric catalogue (e.g. for the photometric 
SDSS that have an /--band limiting apparent magnitude of - 21.5 
and for M/,„, = -17.5, there is a limiting redshift of z — 0.15); 
even so, these samples are much larger than the corresponding 
volume limited samples (following the previous example, with 
the corresponding spectroscopic apparent limiting magnitude of 
17.77 and the same limiting absolute magnitude, the resulting 
maximum redshift is - 0.04). For this reason, our results cover 
a wider range of masses than direct measurements; this effect is 
more pronounced as the absolute magnitudes become fainter, as 
can be seen in figure [T] 

A usual critique to the background subtraction method is its 
inability to evaluate the contamination produced by background 
and foreground overdensities, which cannot be distinguished in 
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Fig. 1 . Comparison between HOD obtained by direct counts in volume 
limited samples (dotted lines), by the background subtraction method 
(shaded grey bands), and in the simulated box (solid lines). From bot¬ 
tom to top, we show the results for M Um = -21.5, -21.0, -20.5, -20.0, 
-19.0, -18.0, -17.0, and -16.0. The errors are computed using the 
standard jackknife procedure. As can be seen, these results suggest that 
our method is in good agreement with direct estimation and supports 
its robustness. It should be taken into account that the proposed method 
uses both photometric and spectroscopic catalogues, while direct esti¬ 
mation only uses spectroscopic data. 


photometric catalogues. In order to estimate this effect and tak¬ 
ing advantage of the mock catalogue, we compute the HOD, dis¬ 
tinguishing between groups with and without other groups inside 
the circle defined by their projected characteristic radius. We find 
that approximately 17% of groups, by projection effects, have 
one or more of these contaminating groups. This percentage is 
homogeneously distributed over the entire range of masses. Fig¬ 
ure [2] shows the results for all groups (solid line), groups with 
contamination (dotted line), and without contamination (dashed 
lines) for M\ im = -19.0. As can be seen, the HOD of groups with 
contamination is overestimated for low masses, while groups 
without contamination reproduce a similar behaviour to the total 
sample, i.e. contamination effects do not contribute significantly 
to the background subtraction estimation of total HOD due to 
the small fraction of contaminated groups. On the other hand, 
the HOD does not seem to be affected for higher masses. This 
is important given that the parameters of the Yang et al. (2008 1 
model are principally determined by the values of the HOD in 
this region. 

Another possible source of uncertainty could be the typical 
error in halo mass estimation. To test the impact of these uncer- 
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Fig. 2. Projection effects in the determination of HOD by the back¬ 
ground subtraction method. The dotted line shows the HOD of groups 
that have, in projections, other groups in the characteristic radius; the 
dashed line corresponds to the HOD of groups without contamination; 
and the solid line is the HOD of all groups. A standard jackknife pro¬ 
cedure is applied to estimate errors. For the sake of clarity we only plot 
the results corresponding to M/, m = -19.0. The bottom panel shows the 
quotient between the HOD estimated with and without contamination 
and the HOD of all groups. 


tainties on our method, we introduce an artificial Gaussian dis¬ 
persion around the simulated halo masses of 0.2 dex, which is 
the maximum observed value in the halo mass-number of mem¬ 
bers relation. This value corresponds to the lower end of mass 
distribution being much smaller for higher masses. For the sake 
of clarity, we only show in figure [3] the results corresponding to 
Mum = -19.0. As can be seen, the uncertainty in halo masses is 
not significant in the estimation of HOD. 

Throughout this work, we have used the geometric centres of 
the groups to estimate the HOD. Thus, to complete the evalua¬ 
tion of the method we also consider the impact of a possible off¬ 
set of the group centres. In figure[3]we also include the HOD es¬ 
timated using the brightest galaxy as centre, demonstrating that 
the method used to estimate the centres of the groups does not 
affect our results. 

As a conclusion to this section, we could argue that the HOD 
estimation using our technique is not affected by projection ef¬ 
fects and possible biases in the mass and positions estimates. 
Furthermore, the results using our method are in excellent agree¬ 
ment with direct count measurements. 


5. Implementation of the method 

5.1. SDSS DR7 HOD 


We now turn to implementing the proposed method to the SDSS 
DR7 catalogue using the data described in Sect. 3.1 Since the 
catalogue does not cover the whole sky, it is possible that the 
areas A a and A, lie partially out of the catalogue. To take into 
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Table 1. Comparison between parameters of HOD for satellite galaxies 
obtained by jYang et al.| ( j2068) and by our method. 


Mum 

^TCO 

-16.5 

-17.0 

-17.5 

-18.0 

-18.5 

-19.0 

-19.5 

- 20.0 

-20.5 

- 21.0 

-21.5 


Yang et al. (2008) 


M n 


a 


12.48 + 0.04 1.01 +0.06 
12.62 + 0.02 1.05 + 0.04 
12.77 + 0.02 1.06 + 0.03 

12.93 + 0.01 1.07 + 0.02 

13.15 + 0.01 1.09 + 0.02 

13.44 + 0.01 1.10 + 0.02 

13.82 + 0.01 1.13 + 0.02 

14.34 + 0.01 1.33 + 0.07 


Background subtraction 
My a 


11.9 

± 

0.6 

0.97 

± 

0.03 

12.0 

+1 

0.5 

0.98 

+i 

0.03 

12.1 

+1 

0.5 

0.99 

+i 

0.03 

12.2 

+1 

0.5 

1.00 

+i 

0.03 

12.3 

+1 

0.6 

1.01 

+i 

0.03 

12.4 

+1 

0.6 

1.03 

+i 

0.03 

12.6 

+1 

0.6 

1.04 

+i 

0.03 

12.8 

+1 

0.6 

1.05 

+i 

0.03 

13.0 

+1 

0.7 

1.06 

+i 

0.05 

13.3 

+1 

0.7 

1.11 

+i 

0.05 

13.7 

+1 

0.7 

1.13 

+i 

0.05 

14.2 

+1 

0.7 

1.33 

+i 

0.05 


al. (|2011]) semi-analytical model was not specifically made to 


reproduce this statistic, a comparison between SDSS DR7 HOD 
(solid lines) and mock HOD (grey lines) in figure [4] shows the 
semi-analytic results to be slightly higher than the corresponding 
values from the SDSS. This discrepancy suggests that the HOD 
could be used to improve the semi-analytical model. 


Fig. 3. Effects of variations in mass and positions of groups. The dotted 
line corresponds to the groups with an artificial Gaussian dispersion 
around the simulated halo masses and the dashed line shows the HOD 
of groups using the brightest galaxy as centre. The solid line is the HOD 
of groups in the mock, with geometric centre and masses taken from the 
simulation, as in previous plots. Errors are determined using a standard 
jackknife procedure. These results correspond to M/„„ = -19.0. The 
bottom panel shows the quotient between the HOD estimations. 


account the catalogue geometry, we use the routines provided 
by the SDSSPix software specifically designed for the SDSS ge¬ 
ometry (see e.g. Swanson et al.ll ( 2008| ). To determine the an¬ 
gular selection function of the survey, we divide the sphere into 
^ 7,700,000 equal area pixels. We then compute an initial bi¬ 
nary mask by setting to 1 all pixels with at least one galaxy 
inside, and 0 the remaining pixels. Given that the pixel size is 
much smaller than the mean inter-galaxy separation, we smooth 
the initial mask by averaging each pixel over its 7 x 7 adjacent 
neighbours. We compound the angular selection function as the 
set of all pixels with a value that exceeds a given threshold. To 
obtain an estimation of A, and A 0 as precisely as possible, we 
perform a second equal area pixelization with a higher resolution 
(1,962,900,000 pixels). The corresponding areas will be propor¬ 
tional to the number of pixels lying in the intersection between 
the mask and the circles or the annuluses, respectively. 

In this case the brightest galaxy distribution is very similar 
to the mock catalogue, consequently we apply the same restric¬ 
tions in redshift according to the mass. The only difference is for 
higher masses where the tail of the distribution is at M ^ -21.0, 
which corresponds to a redshift of z — 0.17. 


In figure[4]we show the HOD obtained for six ranges of limit¬ 
ing absolute magnitudes as indicated in each panel. The expected 
behaviour of these statistics can be seen, i.e. the decrease in the 
number of galaxies at brighter magnitudes and the characteristic 
decay of the HOD described by |Zheng et al.| ( [2005| ) (only visible 
in the bottom panels). 


Although the results of the semi-analytical model cannot be 
compared directly with those for the Sloan because the Guo et 


5.2. Comparison with previous results 


To compare our results with those of jYang et al.| ( |2008| ) for SDSS 
data release 4, we use the parameterization of the satellite galax¬ 
ies distribution proposed by these authors: 


(N s \M h ) = 


Mh_ 

M ( , 


(4) 


were M 0 is the characteristic mass of a halo above which there is 
on average at least one satellite, and a is the power-law index. In 
figure[4]we have included the fit for each range of limiting abso¬ 
lute magnitude (dotted lines) and in figure[5]we plot the best fit 
parameters as a function of the limiting absolute magnitude (grey 
circles) together with the corresponding to Yang et al. ( |2008| l. 
As can be seen, there is an excellent agreement between our re¬ 
sults and previous results. Furthermore, this figure makes clear 
an important feature of our method: the ability to estimate the 
HOD for fainter absolute magnitudes than the previous works, 
the shaded area in the figure shows the new values obtained with 
this technique. For completeness and to clarify, the information 
presented in figure |5lis also compiled in table 1. 

Weinmann et al. ( j2011 ( i collects information for four well 
studied galaxy clusters (Fornax, Virgo, Perseus, Coma) that can 
be confronted directly with our results. Figure[6]shows the abun¬ 
dance of these clusters for two ranges of limiting absolute mag¬ 
nitudes taken from Weinmann et al j( |2011| > and the HOD in mass 
bins centred in the mass of these clusters obtained applying our 
method to the Yang et al. ( j2012[ ) group sample and the photomet¬ 
ric version of the SDSS catalogue. The size of the errors associ¬ 
ated with the largest mass is a logical outcome of that the func¬ 
tion was estimated using 9 groups in the case of M/,„, = -16.7 
and 14 groups in the case of M/, m = -19.0. The abundances 
of Fornax, Virgo and Perseus seem to be in agreement with the 
mean number of galaxies for systems of the same masses, mean¬ 
while the Coma cluster shows a population significantly smaller. 


While this seems to be in contraposition with the results of Wein- 
|mann et ak| ( |2011| ), it should be noted that these authors make the 
comparison with a single cluster taken from the semi-analytical 
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Fig. 4. HOD obtained for SDSS DR7 implementing background subtraction method for different limiting absolute magnitude (M ; „„ = 
-16.0,-17.0,-18.0,-19.0, -20.0 and -21.0 as indicated in each panel). We show in solid lines the total HOD while solid circles show the satellite 
distribution. Dotted lines correspond to the best linear fit of satellite distribution. Just for comparison, we included the occupation corresponding 
to the synthetic catalogue estimated in the previous section (gray lines). Error bars are computed using the standard jackknife procedure. 


model of Guo et al. ( 2011| , which does not necessarily represent 
the behaviour of the corresponding mass media. 


6. Conclusions 

In this paper we present a statistical approach to estimating the 
HOD based on the background subtraction technique that com¬ 
bines a spectroscopic galaxy group catalogue with a photomet¬ 
ric galaxy survey. Given the nature of the photometric catalogue 


our method is able to deal with large volumes, consequently the 
HOD can be estimated in a wider range of masses even when 
faint absolute magnitudes are considered. A further consequence 
of the use of a photometric galaxy catalogue is the ability to mea¬ 
sure weaker magnitudes than was possible when using spectro¬ 
scopic information alone. 


Our method was tested using mock catalogues constructed 


from a semi-analytic model of galaxy formation (Guo et al. 


|2010|> applied on top of the Millennium I Simulation. We show 
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Fig. 6. Comparison between galaxy abundances of four observed clus¬ 
ters taken from Weinmann et al7]< |20 1 1} (triangles) and the correspond¬ 
ing results obtained with discount method (circles) in mass bins cen¬ 
tred in these clusters. Here, we use masses of 6 x 10 13 M o for Fornax, 
1.7 x 10 14 M o for Virgo, 6.7 x 10 I4 M G for Perseus and 1.3 x 10 l5 M o for 
Coma. Top panel show the abundances for M Um = -16.7 and bottom 
panel for Mi im = -19.0. 


Fig. 5. Parameters of HOD for satellite galaxies (see equation [4|. Top 
and bottom panels show M 0 and a parameters, respectively, as a func¬ 
tion of the limiting magnitude. Our results are shown in grey circles 
while black circles correspond to results of[Yang et al. |2008) l. There 
is a good agreement between both results, furthermore our method al¬ 
lows to extend the range of absolute magnitude to fainter values up to 
Mu m = -16.0 without losing statistical strength (shaded area). Parame¬ 
ters and error bars correspond to the best values obtained following the 
X~ Minimization Method. 


that the background subtraction technique is able to recover the 
HOD resulting from the model and from the mock catalogue us¬ 
ing a direct count in volume limited samples as described by 
Yang et al. ( 2008) 1. Additionally, we studied projection effects 
as a source of possible bias as well as the impact of errors on 
the mass and centre estimates. We find that only 17% of groups 
have one or more overlapping projection groups resulting in a 
statistically unnoticeable distortion with respect to the model; in 
addition, the introduction of variations in the mass and positions 
of the groups does not significantly affect our results. 

We obtained the HOD of SDSS DR7 in twelve ranges of lim¬ 
iting absolute magnitudes which expand from -16.0 to -21.5 in 
Sloan r-band. This estimation was performed using the groups 
catalogue provided by |Yang et ah) ( |2012| ) and the SDSS DR7 
photometric catalogue. Our results are consistent with those ob¬ 
tained by Yang et al.]( 20081, and given the nature of our method, 
the samples of groups used to estimate the HOD are deeper and 


more numerous, allowing us to extend the range of masses be¬ 
yond the limits explored in previous works and also to achieve 
absolute magnitudes as faint as M/, m - 16.0. 

The proposed technique is able to combine two catalogues, 
one of groups and the other of galaxies, that occupy the same 
sky area independently of the origin of the group catalogue and 
the photometric bandpass of the galaxy survey. Its features make 
it particularly useful for studying the behaviour of the HOD as 
a function of different properties of groups or galaxies and will 
allow us to fully exploit the benefits of the new generations of 
galaxy catalogues. Possible applications could be the study of 
the population of dwarf galaxies (in the range of M\ im = -16.0 
to -19.0) or constraints on semi-analytical models. 

Finally, we will use the results of this work to extend the 
anisotropic halo model developed by |Sgro et al. ( 2013) 1 in order 
to evaluate the ability of this model to reproduce observational 
results. 
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